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A low-complexity algorithm for calculation of the LMMSE filter 
coefficients for GFDM in a block-fading multipath environment is 
derived in this letter. The simplification is based on the block circularity 
of the involved matrices. The proposal reduces complexity from cubic 
to squared order. The proposed approach can be generalized to other 
waveforms with circular pulse shaping. 


Notation: Matrices are typeset in bold notation, (g) denotes the Kronecker 
product, Ijv denotes the N-dimensional identity matrix and Fjv is the 
unitary N-point DFT matrix, denotes the remainder of x modulo 

N. and {■)^ denote matrix transpose and hermitian conjugate, 

respectively. diag(.. .) returns a (block)-diagonal matrix with its 
arguments on the diagonal. 


Motivation and Problem Description: Recently, several waveforms for 
5G networks have been proposed m Among them, waveforms utilizing 
circular pulse shaping OEIEI structure the signal into self-contained 
^ b locks that can be separated by a cyclic prefix (CP) to combat inter-block 
_ i nterference Generalized Frequency Division Multiplexing (GFDM), the 
^;^rst 5G waveform that used circular pulse shaping, provides a flexible 
'' rime and frequency grid that can be explored to provide low out of 
?-^tand radiation and robustness to time- and frequency misalignments 1 ^ . 
Qsbr real-world implementations, low-complexity algorithms are always 
<of concern. For GFDM, literature provides low-complexity descriptions 
for the linear GFDM modulator and demodulator HI and the design 
' bf zero-forcing and LMMSE filters for additive white Gaussian noise 
(A WGN) channels (5). However, no low-complexity implementation of 
^the LMMSE demodulator for multipath environments is available. This 
^letter presents an algorithm with significantly reduced complexity for 
calculating the LMMSE filter coefficients for GFDM. 


The time domain signal of one GFDM block is given by 
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^^vhere N = KM, n = 0,... ,N — 1, g[ri\ denotes the prototype transmit 
^«-41ter and dj. g A is the data symbol to be transmitted on the 
fTfcth subcarrier and mth subsymbol taken from the complex-valued 
(^'“"c onstellation X. Eq. is written in matrix form as a: = Ad*with 

in 
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. ^where the column vectors are gk,m = (3fc,mM)n=o,....iV-i, (^contains 
iit the appropriate order and E[d] = 0, i?[d*d^] = Ijv- g[n] is 
< pandlimited within two subcarriers, i.e. FjvSfc.o only has 2M nonzero 
j^lements centered around the index kM |3- The signal is transmitted 
through a block-fading wireless multipath channel with impulse response 
h. Assuming a CP between blocks that is longer than h, the received time 
domain signal per block is given by 


y = Hx = UAd- 


(4) 


where H is an AC x A circulant matrix with the zero-padded channel 
impulse response h as its first column and w ~ CJV{0, c^Ijv) is AWGN. 
From 1^, the LMMSE equalizer for d*is given by dLMMSE = where 

W are the LMMSE filter coefficients given by 

W = HA((HA)H(HA)-F(72ijy)-i, ( 5 ) 

Sec II. reviews properties of block circulant (BC) matrices. A low- 
complexity solution for computing W in is developed in Sec. Ill and 
its complexity is evaluated in Sec IV. Sec V concludes this letter. 


Block-circulant Matrices: Let X he an A x A BC matrix composed of 
M arbitrary submatrices {X^} of size K x K each, i.e. 
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X is block-diagonalized by Z = Fm ® Ik 16) such that 

ZXZ^ = diag(Dx,o, Dx,i, ■ • •, Dx,Af-i), 0) 

where Dx.u is the uth submatrix of size K x K on the diagonal 
of ZXZ^. Note that Z performs a discrete ZAK transform 
on its argument |5(. Let Xs be the first K colums of X, i.e. 
X, = (XT X7 ■■■ XT_j)^.Then|6l, 

Dx = l^x,i =ZXs. ( 8 ) 

Note the similar behaviour of circulant matrices: Let Y be a circulant 
matrix with y in the first column, then FivYFx = diag(Fjv?7). Let 


Zu — cju ® Ik (9) 

with(Ii„ = (l ■■■ (10) 

and oj = exp( ) such that Dx,ii = Zt,Xs. Note that, according to 
above block diagonalization, the product and sum of two or the inverse 
of one BC matrix is again BC. 

Reduced complexity Filter Calculation: From definition A is BC 
with M blocks with size K x K and is hence block-diagonalized by Z. 
Also, circularity of H implies block circularity. Accordingly, G = HA, 
(G^G -I- (T^Ijv) and W are all BC matrices. Hence, W is completely 
defined by its diagonalization Dw = ZWs. The M blocks of Dw are 
given by the M equation systems 

Dw,. = ((dH^„Dg.„ -f ,72 Ix)-1dH (11) 

where u = 0,... ,M — 1 and Dg,„ = ZuGs. Now, Z^Z„ = 
(g) I/f is a circulant matrix since ((icu)^tcu)i,j =a;“L“i) 
is circulant and accordingly :=FjvZ^ZuF^ is a diagonal matrix. 
Due to band-limitation of g[n] only adjacent subcarriers overlap and 
(FkG«)Hd„F nGs is a tridiagonal matrix with periodic boundary 
conditions. Once Dw is known, the first K columns of W in the time 
domain are given hy 

W„ = zHDw, ( 12 ) 

and remaining columns are given as circular shifts of W 3 . In addition, 
the ZAK domain Dw ean be also directly transformed into the 
frequency domain to readily employ a low-complexity receiver as in m 
Furthermore, LMMSE filtering can even be directly performed in the 
ZAK domain by 

ZiIlmmse = diag(Dw, 0 i L>w,ii ■ • •. Dw.Af-rl^Zg (13) 

Complexity Analysis: In this section, the arithmetic complexity of the 
proposed algorithm is evaluated, considering one complex multiplication 
as one 0 ( 1 ) operation and neglecting other operations such as additions. 
The product Zx is equivalent to K DFTs of length M each, and 
hence requires AM Id mQ operations, yielding A^ M Id M operations 
for ZXs. 

To compute FjvGs = FjvHAs we take advantage of the factorization 

FxG, = FxHFHFjvA, (14) 

where FjvAs is precalculated at the receiver. This is done by DFT for 
the first column. Other columns are given by circular shifts since As 
contains frequency shifts of g. Considering band-limitation of g{n], the 
multiplication of FjvAs with the diagonal matrix FjvHF^ takes 2MK 
operations. The diagonal of FjvHF^ is assumed to be available from 
previous channel estimation procedures. The product G^F^D„FjvGs 
requires 3A • 4M operations due to the tridiagonal structure of the result 
and band-structure of FjvGs. 


* Assuming that the A-point DFT requires A Id A operations. 
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Fig. 1 Number of floating point operations for proposed low-complexity 
LMMSE calculation. 

The tridiagonal system with periodic boundary conditions 

(Dg_„DG,„ + <72 i^)-1Dg,„ (15) 

is solved using the Thomas algorithm (8) with 2K operations for 
factorization and 5K operations for solving for each right hand side, 
resulting in 2K + 5K^ operations for the full linear system )9j. Finally, 
calculation of Z^Dw requires K^M\dM operations. 

Hence, the number of complex multiplications Csparse to solve with 
the proposed method is given by 


Csparse = 2ATiT +M( 3it: ^4M + 2X + 5iT^) + K^MldM (16) 
(a) (b) (c) (d) 

= K^(5M + MldM) + K{12M^ + 4:M) (17) 

= {K^M Id M + KM^) (18) 


where (a) corresponds to calculation of FjvGs, (b) respects 
G^F^DuFjvGg, (c) describes the solution of the tridiagonal 
system and (d) accounts for Z^Dw For comparison, direct application 
of a conventional Hermitian positive definite solver to requires 
Cdirect = ^N operations only for the solution stem where the 
first term corresponds to Cholesky decomposition and the second term 
refers to backward and forward substitution for N right-hand sides. An 
additional advantage is the reduced memory requirement of the proposed 
algorithm, as it suffices to store the MK^ filter coefficients for 
instead of coefficients for W. 

Fig. [T] compares the number of eomplex multiplications required for 
the proposed technique and for conventional solving with Cholesky 
decomposition for different values of K and M. The number of required 
operations can be redueed by 4 orders of magnitude for 128 subearriers. 
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Conclusion: A low-complexity approach for the calculation of LMMSE 
filter coefficients for block-fading multipath channels for GFDM has 
been presented. The proposal significantly reduces the complexity of 
the design from (K^M^) to (K'^MldMKM"^) which results in 
a complexity reduction of several orders of magnitude for reasonable 
system sizes. Since the technique exploits the block-circulant structure 
of the modulation matrix, it can be generalized to other multicarrier 
waveforms employing circulai* pulse shaping. 
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^ i.e. product of HA etc. is not considered 
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